Building a Knowledge base through Open Information Extraction techniques
ثبت نشده
چکیده
The emergence of the Open-IE paradigm has given way to domain independent techniques for information extraction. Its capability of extraction new relations from text shows potential towards the task of building a knowledge base. However, its advantages also come with its disadvantages such as the lack of context expressed by the relations extracted by Open-IE systems. This research attempts to tackle the different problems associated with the incorporation of Open-IE towards the building of a knowledge base.
منابع مشابه
Towards Building Open Knowledge Base From Programming Question-Answering Communities
In this paper, we propose the first system, so-called Open Programming Knowledge Extraction (OPKE), to automatically extract knowledge from programming Question-Answering (QA) communities. OPKE is the first step of building a programming-centric knowledge base. Data mining and Natural Language Processing techniques are leveraged to identify duplicate questions and construct structured informati...
متن کاملInformation Extraction over Structured Data: Question Answering with Freebase
Answering natural language questions using the Freebase knowledge base has recently been explored as a platform for advancing the state of the art in open domain semantic parsing. Those efforts map questions to sophisticated meaning representations that are then attempted to be matched against viable answer candidates in the knowledge base. Here we show that relatively modest information extrac...
متن کاملKnowledge Assimilation and Web Deployment Techniques for Conversational Agents
We describe techniques for building conversational agents that integerate online and offline knowledge bases with logical inference engines. Information sources such as the WordNet knowledge base and a Web content extraction agent using Google’s Web-search API cooperate in a distributed multi-agent component environment. Our agents interact with users over the Web, as voice-enabled animated cha...
متن کاملOpportunities and Challenges Presented by Wikidata in the Context of Biocuration
Wikidata is a world readable and writable knowledge base maintained by the Wikimedia Foundation. It offers the opportunity to collaboratively construct a fully open access knowledge graph spanning biology, medicine, and all other domains of knowledge. To meet this potential, social and technical challenges must be overcome many of which are familiar to the biocuration community. These include c...
متن کاملDependency-Based Open Information Extraction
Building shallow semantic representations from text corpora is the first step to perform more complex tasks such as text entailment, enrichment of knowledge bases, or question answering. Open Information Extraction (OIE) is a recent unsupervised strategy to extract billions of basic assertions from massive corpora, which can be considered as being a shallow semantic representation of those corp...
متن کامل